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ABSTRACT 

A new model for the luminosity distribution in the inner Milky Way is found, using 
a non-parametric penalized maximum-likelihood algorithm to deproject a dereddened 
COBE/DIRBE L-band map of the inner Galaxy. The model is also constrained by the 
apparent magnitude (line-of-sight) distributions of clump giant stars in certain bulge 
fields. An important new feature is the inclusion of a spiral arm model in the disk. 

Spiral arms make the model appear broader on the sky, thus our bar is more 
elongated than in previous eight-fold symmetric models. They also lead to a smoother 
disk model interior to the Sun. The bar length is w3.5 kpc and its axis ratios are 1:(0.3- 
0.4) :0. 3, independent of whether the spiral arm model is 4-armed or 2-armed. The 
larger elongation in the plane makes it possible to reproduce the observed clump giant 
distributions as well. With only the surface brightness data a small model degeneracy is 
found even for fixed orientation of the bar, amounting to about ±0.1 uncertainty in the 
in-plane axial ratio. Including the clump giant data removes most of this degeneracy 
and also places additional constraints on the bar's orientation angle. We estimate 
15 deg < (/Pbar ^ 30 deg, with the best models obtained for 20 deg < iphar ^ 25 deg. 

We use our reference model to predict a microlensing optical depth map towards 
the bulge, normalising its mass by the observed terminal velocity curve. For clump 
giant sources at {l,b) — (3.9 deg, —3.8 deg) we find r_6 = r/10~^ = 1.27, within 1.8a 
of the new MACHO measurement given by Popowski et al. The value for all sources at 
{I, b) = (2.68 deg, -3.35 deg) is r_6 = 1.1, still > '3a away from the pubhshed MACHO 
DIA value. The dispersion of these r^g values within our models is ~ 10%. Because the 
distribution of sources is well-fit by the NIR model, increasing the predicted optical 
depths by > 20% will be difficult. Thus the high value of the measured clump giant 
optical depth argues for a near-maximal disk in the Milky Way. 

Key words: Galaxy: structure - Galaxy: centre - Galaxies: spiral. 



1 INTRODUCTION 

Observations of the Milky Way [MW] show significant sys- 
tematic differences between the NIR surface brightness of 
t he M W at / > Odeg a nd I < Odeg (e.g.. Blit z fc Spergel 
([L991[), Weiland et al. (|l994|), Bissantz et al. ([L997|)). It is 



widely accepted that these variations reflect the fact that the 
MW is a barred spiral galaxy. Evidence for a barred compo- 
nent of the luminosity density in the inner M W also comes 
from starcount ob serva tions (e.g., S tanek et al. (L997), Niko- 
laev & Weinberg (1997), Sevenst er (|1999 ), Hammersley et al. 
(1999), Lopez-Corredoira et al. ( 200 ( TT fro m gas- dynamics 
(e.g., Englma ier fc Gerhard ( |l999| ), Fux (|l999| ), Weiner 
& Sellwood ( 1999[ )), and microlensing observations (e.g., 
Paczynski et al. ( |1994| ), Zhao, Rich & Spe rgel (|l996| )). Fur- 



ther references can be found in Gerhard (( 200l| 



The starcount data show significant asymmetries be- 
tween lines-of-sight that are symmetrical with respect to the 
/ = axis; this is the signature of a bar with its near end 
at positive Galactic longitudes. Most importantly, starcount 
data contain information about the distances to the surveyed 
stars. This is complementary to the all-sky coverage of sur- 
face brightness maps, and is valuable for constraining the 
line-of-sight structure of the bulge even if available only for 
a restricted number of flelds. In this paper we will take one 
step towards combining the information from both kinds of 
d ata, a n d use the clump giant observations of Stanek et al. 
((|1994|), (|1997|)) together with the COBE/DIRBE NIR data 



to determine a model for the luminosity distribution in the 
inner Galaxy. With this model we can be more confident 
about the line-of-sight distribution (LOS) of microlensing 
sources, and are thus in a much better position to predict mi- 
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crolensing optical depths for comparison with the re cent de - 
terminations fro m the MACHO group (Alcock et al. (2000a) 
Popowski et al. (|2000|)). 



Most previous models of the inner MW have been para- 
metric and are thus restricted towards certain classes of den- 



sities for the bulge and/or disk. Binney & Gerhard (199(:) 



developed a nonparametric approach to the deprojection of 
the COBE/DIRBE data based on the Richardson-Lucy al- 
gorithm, in which by construction the luminosity models are 
eight-fold symmetric with respect to the three main planes of 
the bar /bu lge. M odels constructed with this approach (Bin- 
ney et al. (L997); Bissantz et al. (1997)) give a good fit to 
the COBE/DIRBE L-band data, but predict less asymmet- 
ric LO S dist ributions towards the fields observed by Stanek 
et al. (1994) than observed, by more than 0.1'". Eight-fold 
symmetry also excludes modeling the s piral arms of the MW 
( see, e .g., Englmaier & Gerhard (199!:), Drimmel & Spergel 
(2001)). In the present paper we describe a non-parametric 



penalized likelihood approach to infer the luminosity den- 
sity of the inner MW from the COBE/DIRBE data which 
allows us to include a spiral arm model. 

This paper is organized as follows. Section 2 describes 
our new deprojection algorithm. In Section 3 we test the 
method with known parametric distributions and analyse 
the uniqueness of the deprojected bar shape. In Section 4 
we present models for the luminosity distribution of the 
MW which are consistent with both the COBE/DIRBE L- 
band data and the observed asymmetry in the distribution 
of clump giant stars, and give constraints on the orientation 
angle of the Galactic bar. In Section 5 we predict the mi- 
crolensing optical depths for these models and compare to 
recently published results of the MACHO experiment. We 
close with a summary and conclusions in Section 6. 



2 MAXIMUM LIKELIHOOD DEPROJECTION 
METHOD 

In this Section we describe the technique we have used to 
construct models for the Milky Way's luminosity distribu- 
tion. It is a non-parametric technique that maximizes a like- 
lihood function, which includes penalty terms encouraging 
smoothness, eight-fold (triaxial) symmetry and a spiral arm 
component in the model. The minimisation procedure is iter- 
ative, starting from an initial parametric model. The follow- 
ing subsections describe the initial parametric models (^1), 
the algorithm (§^.2), the choice of optimal penalty parame- 
ters ([|2-3) ^-nd the performance of the algorithm (§^.4). The 
results of using the algorithm to recover known solutions 
from artificial data are described in sfl 



2.1 Parametric models 

We define parametric models for the luminosity distribution 
of the MW on a cartesian grid. The coordinate system has 
the Galactic centre at its origin. The axes are parallel to the 
main axes of the bar. In this coordinate system the position 
of the Sun is (a; = iiocos(v3bar), y = ^osin(<^bar), Zq), where Ro 
is the distance of the Sun from the Galactic centre projected 
onto the main plane of the Milky Way, Zq the position of 
the Sun above the xy-plane, and </5bar the "bar angle", i.e., 
the angle in the xy-plane between the major axis of the bar 



and the projected line-of-sight from the Sun to the Galactic 
centre, such that for positive (ySbar the near end of the bar 
is at positive longitudes. Throughout this paper we will use 
-Ro ~ 8kpc and Zq = 14 pc. 

Our parametric models contain a double-exponential 
disk and a truncated power-law bulge (cf. Binney, Gerhard 
& Spergel (|l997|)): 

(1) 



p(5) = Pd{x) + pt{x), 



where 



Pd 



Pb 



Pd- Rd- e 



2 I Z 

-a /a 



+ a- 



H — T + 7^1 -R = v a;^ -I- j/^ and x- 



In some models we also include an additiona l spir al arm 
component. This is taken from Ortiz & Lepine (199J), who 
obtained a good fit to the tangent directions seen in in- 
frared star counts with four loga rithmic spirals. See Table 
1 of Englmaier & Gerhard ( 1999 ) for other tracers of these 
tangent directions. The positions of the spiral arms ri{(f>) 
(i = 1, . . . , 4) are given by 

= 2.33kpc- e'"*-^''--"^')-*""^^', 

where the angle (pi — 0, n/2, n, 3n/2 determines the starting 
angle of a spiral arm in galactocentric coordinates with re- 
spect to the major axis of the bar, and x = 13.8 deg is the 



pitch angle of the arms (Ortiz & Lepine (1993)). We use this 



4-armed logarithmic spiral in the range between an inner ra- 
dius of 3.5 kpc and an outer radius of approximately 10 kpc. 
We do not ensure a smooth transition to the bar in the para- 
metric model. The spiral arms are modelled with a Gaussian 
profile with FWHM usually « 300 pc, again after Ortiz & 
Lepine, but we have also computed models with ~ 500 pc, 
without improving the fit to the data as described below. 
In our parametric models we treat this spiral arm model as 
an enhancement of the density of our standard disc model, 
keeping all the above spiral arm parameters fixed and vary- 
ing only the amplitude ds of the density modulations: 



includiiiK spiral 
Pd =Pd 



n 



l+ds 



- lii(2)-Ar^/(0.5'FWHM)'' 



where Ar^ is the (approximate) distance to the nearest point 
on spiral arm i. The projected density is matched to the 
COBE/DIRBE L-band data and the best fit parameters are 
found with our implem entati on of the Marquardt-Levenberg 
algorithm (Press et al. (1994)), now with ds as an additional 
fit parameter. These parametric best-fit models (as a func- 
tion of the bar angle v'bar) are used both as initial models 
and to define a spiral arm bias term in the penalty func- 
tion (see below) in the non-parametric deprojection of the 
COBE/DIRBE L-band data. 

2.2 The algorithm 

Our approach is non-parametric; the idea is to maximize a 
likelihood function which includes penalty terms encourag- 
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Figure 1. The eight-fold symmetry term in the penalty function 
encourages symmetry of the density function with respect to the 
mirror planes shown in this figure. Coordinate directions are in 
the bar frame. 



ing smoothness, eight-fold symmetry and a spiral structure 
close to the imposed four-armed pattern. Thus also the bar 
by itself is not forced to obey eight-fold symmetry, but will 
be nearly triaxially symmetric as far as allowed by the data 
and the other constraints. 

For the technical realization, the model density is de- 
fined on a cartesian grid. Stepsizes are identical in x and 
y. The stepsize in z is smaller than for x and y because we 
expect the most rapid spatial change of the density along 
z. The "standard" grid consists of 60 x 60 x 41 points cov- 
ering a box of (— 5kpc<a;<5kpc) x (— 5 kpc< j/< 5 kpc) x 
(—1.5 kpc< z < 1.5 kpc) in x,y,z. The size of the box is cho- 
sen so as to emphasize the bar region; outside its bound- 
aries the parametric model is left unchanged. This leads to 
a discontinuity in the density at the grid boundary; for ex- 
ample, averaged over the high-density region \z\ < 450 pc 
around the grid boundary, the rms discontinuity is < 1%. 
The likelihood maximisation is done using a conjugate gra- 
dient method. 

The likelihood function C maximized by the algorithm 

is 



L[ln(p)] = 



4 



2 

XSB 



(2) 



X- D'[ln{p)] + o 



2 , 

Xs + s ■ 



2 N 

Xspiral/ 



where the individual penalty terms are now described in 
more detail, and A, o, s are the penalty parameters. 

1. Surface brighntess term: 
xIb= Y1 [lIlp(m,n)]-SB(m,n)]^ (3) 

all SB(m,n) 

where Virrijn) is the projection of the density p along 



the LOS at the sky position {lm,bn) of the corresponding 
COBE/DIRBE surface brightness data point SB(m, n) (nat- 
ural log of flux in MJy/sr). Only the parts of the LOS that 
are in the model grid are taken into account in the projec- 
tion. We therefore rescale the observed surface brightness 
for each LOS by multiplying it with the ratio of the surface 
density in the box to the total surface density, calculated 
for the initial parametric model. SB(m,n) denotes this box- 
corrected surface brightness. Outlyers (data points with a 
very large distance to the projection of the initial density 
po- [In ['P(m,n)] — SB(m,n)]'^ > 10) are ignored in the sum 
eq. d). 
2. Smoothness: 

D^[\n{p)] = J2 "^"^^ 

al3^xx,yy,zz,xy,xz,yz 
Interior points(i,j',fc) 

This penalty term encourages smoothness of the density 
distribution by minimizing the total second derivative. All 
partial second derivatives are taken into account, and are 
symbolized by their "coordinate direction" af3. For example 
a/3 — xy stands for gfg^. All second derivatives are evalu- 
ated only at interior grid points not on any boundary of the 
box. Because the stepsize Sz in the density grid is smaller 
than the stepsizes Sx = Sy, the six "coordinate direction" - 
terms are given weights Wa/s. These are Wxx—Wyy—Wxy— 1, 

Wxz = Wyz = (^Y') ^^'^ ^^^~{t') ■ "^h® functions Dafi are 
first order approximations for the second derivatives along 
the "coordinate direction" af3; for example 

Dxx = In (pi+i j,fc) — 2 • In {pi,j,k) 
+ ln((pi_ij,fc) , 

Dxy = [In (pi+ij+i,fc) — In (p,;_i,j+i,fe)] 
— [In (pi+ij_i,fc) — In (pi_ij_i,fc)] . 

We expect significant differences in the expected second 
derivatives between different parts of the grid. For exam- 
ple high rates of change of the density are expected at 
small galactocentric radii. Therefore we give each grid point 
and "coordinate direction" a/3 additional individual weights 
w"^_.^y We have tried two different approaches for these 
individual weights. Using the density p of the initial non- 
parametric model, we have used 



XX 



and 

xy 



ln(p) 



4- 



ln(p). 



/ 1^ ■ [ln(p),+ i,j,fc -ln(p),_i,^. J 
• [ln(p),,,+i,,-ln(p),,^_,,j]|. 

or, in the second approach, w'^fj f.^ ~ D^p [ln(p)]. We have 
found no significant differences between models based the 
two approaches. Therefore we do not expect a significant 
infiuence of the exact definition of these weights on our re- 
sults. 



© 0000 RAS, MNRAS 000, 000-000 



4 N. Bissantz and 0. Gerhard 



We have also tried a smoothing term defined on a cylindrical 
grid, using p(r, 0, z) and a smoothness penalty term 



E 



E 



{ 



af3£rr,(l)4>tZZ Interior points(i ,j,fc) 



Here G a/sir) = 1 for a/3 = rr A a/3 = zz and G^^{r) = 
r/vmax- In tests comparing the two different smoothing 
penalty terms we have found that the cartesian smooth- 
ing needs somewhat better initial models to give good fi- 
nal results, while the cylindrical smoothing introduces some 
bias towards round models. However, the main results de- 
scribed in the sections to follow have been checked by doing 
the calculations with both approaches and were found to be 
identical. Our results therefore do not depend on the pre- 
cise smoothing approach and in the following, the cartesian 
smoothing will generally be used. 

To close the discussion of the smoothness penalty term we 
remark on a technical detail. The algorithm to maximize 
the likelihood function evaluates the gradient 



ac 



S(ln(Pijfc)) ' 

This gradient is modified slightly in the case of a cartesian 
smoothness penalty term: terms that couple a point to its 
neighbours of second order (i.e., not their nearest neigh- 
bours) are then omitted. We find that without this change 
the isodensity contours in the outer parts of the final models 
become rectangular because the cartesian smoothness term 
favours straight contours parallel to the coordinate axes. 

3. Eight-fold symmetry: Triaxial symmetry with respect to 
three principal planes of the bar (see Fig. |l|) is an essential re- 
quirement for being able to obtain a throe-dimensional lumi- 
nosity distribution from the COB E / DI RBE surface bright- 
ness map (Binney & Gerhard (L996)). Bars in external 



galaxies are observed to be approximately but not strict ly 
eight-fold symmetric (e.g., Sellwood & Wilkinson (1993)); 



In our deprojection we therefore aim to find a luminosity 
distribution that is as nearly eight-fold symmetric as is com- 
patible with the data and the smoothness constraint. This is 
done by discouraging deviations from eight-fold symmetry 
through the penalty term 



xl 



In (pi- 



i,j,k pairs 



Here the inner sum is taken over all distinct pairs of grid 
points constructed from the eight mirror-symmetric points 
of grid point {i,j,k), that should have identical luminosity 
density if the distribution were fully eight-fold symmetric. 

4. Spiral structure term: Generally there is not enough in- 
formation in the COBE/DIRBE surface brightness data to 
determine the luminosity distribution in the Galactic spiral 
arms. Essentially the only information about the spiral arms 
in these data is an enhanced surface brightness in th e arm 
tangent directions (see also Drimmel & Spergel (2001)). De- 
riving a sensible model therefore requires using additional, 
external information on the arm pattern. For most of our 
models we assume that the Galactic spiral arms in the NIR 
are described approximately by the four-armed pattern that 
seems to be most consistent with observations of HIT regions, 
gas, y oung stars, and NIR starcounts (see Ortiz & Lepi ne 
(1993) and the summary in Englmaier & Gerhard (199£)). 
We leave open the question whether the old population of 
the Galactic disk follows this four-armed or rather a two- 



armed pattern. In practice, we discourage deviations from 
the disk part of the initial parametric model p that is also 
used to start the iterations , wh ich includes the Ortiz-Lepine 
spiral arm model (Section |2.l[). The penalty term is: 



2 

Xspiral 



with 



E E[™w 



(4) 



k: I z(k) I <2s i.j 



+ ln(pij,fc) - ln(pij,fc)] 



In 



E 



fe ,all i,j 



fc.all i,j 



The outer sum in eq. (j^ is computed over all planes paral- 
lel to the main plane of the MW with indices k for which 
\z{k)\ < Zs with Zs =300 pc, and the inner sum is taken over 
all points within the current plane. I.e., the model is only 
biased towards the initial model near the disk plane. The 
weights ■w{k) guarantee that the model is encouraged to re- 
semble p only in shape, but not in normalisation. In fact, the 
normalisation ratios w{k) are usually somewhat different for 
every iteration step. We have tested that restricting the spi- 
ral structure penalty term to i? > 3.5 kpc did not change 
our models significantly. 

2.3 Optimal penalty parameters from test models 

Having specified the likelihood function we now need 
to determine the penalty parameters A, o, s which set the 
relative importance of the different penalty function terms. 
These can be found approximately using known test models 
by requiring that all terms in the penalty function should 
be of the same order of magnitude. Otherwise one of the im- 
posed constraints would be given too high or too low weight 
in the resulting model. Because non-parametric models do 
differ from the test models employed in this determination, 
we found it necessary to vary the resulting penalty function 
parameters within an order of magnitude or so, based on a 
(subjective) by-eye assessment of the final model. 

The special properties of the spiral arm bias term re- 
quire an additional modification of this simple scheme. The 
spiral structure is confined to the vicinity of the Galactic 
plane, and there it is neither very smooth on the scale of a 
few grid cells nor is it eight-fold symmetric. On the other 
hand, it penalizes deviations from some model p, which has 
a similar effect like a regularization term. After some exper- 
imentation we found that a good solution is to change the 
eight-fold symmetry parameter to 20% of its original value 
in the main plane of the MW and let it linearly rise with 
distance \z\ from the plane to its overall value. Similarly the 
smoothness penalty parameter is set to only 1 percent of 
its maximum value in the main plane, but rises very fast 
oc (1 — (1 — jzj/1.5 kpc)^") with distance from the central 
plane. 

For part of the problem (data, smoothness, and eight- 
fold symmetry) we have tested the above choice of penalty 
parameters with a second approach. We selected two para- 
metric models without spiral structure, that differed by 
about the same amount that we expect our initial mod- 
els in deprojections of the COBE/DIRBE L-band data to 
differ from the "true" model. From one of these models we 
generated artificial data by projecting it onto the sky and 
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adding Gaussian noise with a = 0.076™''^. This is the re- 
maining rms NIR colour variation found by Spergel et al. 
( [L996| ) for this dereddened COBE/DIRBE L-band data set. 
We then deprojected these data using the second parametric 
model as initial model and repeated the deprojections for a 
grid of points in (A, o) space. For all deprojected models we 
computed the rms difference between data and projection 
of the model onto the sky, and the rms difference in the 
natural logarithm between the "true" model and the depro- 
jected model on the density grid. For computational reasons 
we could do this only for a coarser density grid. The result- 
ing optimal penalty parameters A, o, when rescaled to the 
original grid, agree with the values obtained by the "equal 
penalty terms" method to within 1-2 orders of magnitude. 



2.4 Performance and limitation of the algorithm 

We have tested this algorithm with noisy artificial data 
and initial conditions derived from a variety of test mod- 
els. Quantitative results will be given in the next section, 
which studies ambiguituies in the deprojection of Galactic 
bar and disk models from surface brightness data, under the 
assumptions made. Here we discuss only a few qualitative 
points. 

It is clear that information about the "true" model used 
to generate the artificial data is increasingly hard to recover 
as the noise level approaches the magnitude of the signal 
that differentiates between different models on the sky. On 
the other hand, we have found that some noise is helpful as 
a "catalyst" to induce changes in the model. 

The initial models given to the algorithm differed from 
the "true" model by various amounts. We find that the ini- 
tial model must not be too far from the "true" model. This 
is hard to quantify by a distance criterion. However, the ef- 
fects that occur if the initial model is not suitable are easily 
visible in cuts through the density grid, and it is therefore 
possible to reject such models. 

If the initial model is suitable, convergence to a lumi- 
nosity distribution that fits the surface brightness map un- 
der the assumed smoothness, symmetry and spiral structure 
constraints typically takes 20-30 iterations. Otherwise the 
progress of the iterations becomes very slow at some point 
and the model may be caught near something like a sec- 
ondary minimum. In such cases, an iteration step that would 
be required to improve the surface brightness fit is often not 
undertaken because it would move the model too far away 
from eight-fold symmetry. Some secondary minima of the 
likelihood function correspond to nearly perfectly eight-fold 
symmetric models which have, however, physically unrea- 
sonable density distributions. The probability that the algo- 
rithm ends up with such a model increases with the distance 
between the initial model and the "true" model. 

Introducing a smoothness term in a complicated fit- 
ting problem often lessens the importance of such secondary 
minima. Here the additional problem is that a third con- 
straint, eight-fold symmetry, must be introduced to restrict 
the range of possible solutions (for fixed bar angle, the re- 
quirement of eight-fold symmetry restricts the solution to a 
small subset of the very large set of luminosity distributions 
which all project to the same surface brightness distribu- 
tion, Binney & Gerhard (1996)). It is easy to see then that 



secondary minima based on a balance between data and 
symmetry terms can appear in spite of the smoothing. 

If the requirement of eight-fold symmetry is imposed 
only weakly, a characteristic artifact appears in many models 
which we have termed Finger-to-Sun (FTS) effect. This con- 
sists of excess luminosity features in the nearby disk point- 
ing towards the observer. These arise because the depro- 
jection algorithm preferentially changes grid cells near the 
observer. The reason for this is that grid cells near the ob- 
server appear larger on the sky, and therefore contribute to 
many more surface brightness pixels than distant grid cells 
do. Consequently, for a model that is off from the "true" 
model underlying the data by a fixed fraction of the density 
in all grid cells, the total gradient ^xIb/^^ (In Pijfc) is much 
larger for grid cells near the observer (the actual value con- 
tributed to the surface brightness of a given pixel is inde- 
pendent of the line-of-sight distance of the contributing grid 
cells). Therefore, without smoothing and symmetry penalty 
terms, the luminosity model would be changed mainly near 
the observer, resulting in the described FTS efi'ects. 



3 HOW WELL-DETERMINED IS THE 
DEPROJECTION OF THE BAR? 

As a test application of our algorithm, we investigate in this 
section possible degeneracies in the deprojection of a bar 
model for fixed bar angle, keeping <^bar ~ 20deg through- 
out this section. More precisely, we ask what is the range of 
bar luminosity distributions that is compatible with given 
surface brightness data similar in quality to the COBE data 
which we will use in §5 for investigating the luminosity struc- 
ture of the inner Galaxy. We find that there exists a sequence 
of parametric models with different bar elongations in the 
Galactic plane and correspondingly different density con- 
centrations, that look very similar on the sky. Then we use 
the non-parametric algorithm to estimate the width of the 
"valley" of acceptable models around the sequence of degen- 
erate parametric models. Finally, we show that observations 
such as the apparent mag nitude distributions of clump gi- 
ant stars by Stanek et al. (1997) contain sufficient distance 
information to break this degeneracy, which incidentally is 
different from the well-known degeneracy in th e bar angle 
V3bar (Binney, Gerhard & Spergel ( |l997| ), Zhao ( |200C| )). 

We first generate artificial data by projecting a para- 
metric model onto the sky. This model (denoted sip) is 
defined by eq. (jl|), with bar parameters r/ — 0.5, = 
0.6, am = 1.8 and disk parameters zq — 208pc, i?d = 
2.44 kpc, and does not have spiral arms. We add Gaussian 
noise to these data, similar in amplitude to that expected 
for the COBE/DIRBE L-band data used later, for which 

for a 



asB = 0.076" (Spergel et al. (1996), see Section 4.1 



discussion). The parametric model is a best-fit model for the 
COBE/DIRBE L-band data for v?bar = 20deg. 

Next we fit other parametric models to these model sur- 
face brightness data. In these models we hold fixed the bar 
elongation rj. To decide whether a model is a valid match 
to the data, we define two criteria: (i) The average square 
deviations of the projected model from the artificial data in 
magnitudes (hereafter model RMS) must not be worse by 
more than 20% in the bar region |Z| < 36degA|6| < lldeg 
than for the "true" model, (ii) There must not be major 
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Figure 2. Two parametric models on the degenerate sequence for </3bar = 20deg. Note how similar models sip (upper panel) and s2p 
(lower panel) appear to the observer. 



parts of this region where a systematic error larger than the 
approximate average statistical error occurs in the residual 
map. 

With these criteria, we find a sequence of parametric 
models that are indistuingishible on the sky, i.e., for which 
systematic deviations between model and data are smaller 
than the noise in the data. Figure ^compares model sip with 
another parametric model s2p, which is on this sequence. 
Model s2p has bar parameters rj = 0.603, ( = 0.68, am = 
1.5 kpc, significantly different from model sip, but looks very 
similar on the sky. Models on this sequence are characterized 
by a degeneracy between the input bar elongation in the 
ly-plane, rj (see eq. |l]) , and the central concentration of the 
model. This is parametrized as the half mass radius ro.s, 

defined as the elliptical radius ro.s = \J + ^ + ^ which 

contains half of the mass of the bar/bulge inside an elliptical 
radius of 3.5 kpc. 

In Fig. ^ the sequence of parametric models is depicted 
in the ro.s — ?7 plane as the filled hexagons connected by 
the full line. The parametric model at the lower-right end of 
this sequence is just still a valid model as defined above. For 
models with even smaller "q the deviations from the artificial 
data rise rapidly. Fig. ^ shows the model RMS on the sky, for 
parametric models both on the sequence and on its exten- 
sions to higher and lower r], where the latter fail to pass our 
criterion for a valid model. On the high-r; extension of the 
sequence, the model RMS rises fairly slowly, but these mod- 
els have regions with too large systematic deviations from 
the projected data, which increase with increasing 77. From 



Fig. |3| parametric models thus have an uncertainty in their 
model parameters of ±0.1 in r] and ±20% in ro.s for given 
data on the sky. 

We now consider non-parametric models obtained with 
the deprojection algorithm described in First, we start 
the algorithm from initial models on the parametric se- 
quence, resulting in models si and s2 from the initial sip 
and s2p. For the non-parametric models we estimate r] by 
measuring the elongations of the surface density contours of 
the bar, determined from a projection of the model density 
along the 2-axis, but excluding the region \z\ < 225 pc to 
reduce the impact of the strong axisymmetric disk. This is a 
good approximation because the bulges in these models are 
near-ellipsoidal. We measure the half-mass radius ro.s from 
the density distribution of the bar/bulge only, obtained by 
subtracting the disk density of the parametric initial model 
given to the algorithm, from the density distribution of the 
final non-parametric model. We have checked by visual in- 
spection that the resulting bulge densities are reasonable. 
Fig. ^ shows that models sip and s2p lie very close to si 
and s2. The similarity of initial and final models is also ob- 
vious from a comparison of cuts through the densities. We 
conclude from this exercise that the deprojection algorithm 
does not introduce any significant bias in the final model, 
e.g., in the resulting value of 77. 

We will next discuss non-parametric models started 
from parametric models off the degenerate sequence, in or- 
der to investigate how broad the valley of acceptable models 
surrounding the sequence is. Several such models (s3-sl0) 
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Figure 3. Parametric and non-parametric luminosity models for 
surface brightness data obtained from projecting model sip un- 
der i/5bar = 20deg and adding noise. All models are plotted in 
a plane according to their half-mass radius ro.s and bar elonga- 
tion r). The full line delineates the sequence of degenerate para- 
metric models which project to indistinguishable SB data. These 
models are indicated by the filled hexagons along this line. The 
dashed lines connect parametric models (stars) on the extensions 
of this sequence towards high and low rj. These models are not ac- 
ceptable, because they either show too large systematic devations 
from the data, or have quality grade _F > 3 (see eq. Models 
sl-slO are obtained from non-parametric, iterative deprojections 
of the model SB data. Of these, sl,s2 were started from paramet- 
ric models (sip, s2p) along the sequence. Their proximity to the 
original (sip, s2p) demonstrates the absence of significant bias 
in the algorithm. The other non-parametric models were started 
off the sequence (stars on one end of the short lines denote the 
initial configurations) and are separated in the figure into accept- 
able final models (filled hexagons, F < 3), marginally acceptable 
models (3 < -F < 4, large "X"), and inacceptable models (F > 4, 
small "x"). 



are shown in the ro.s — r;-plane of Fig. g|, as stars for the 
respective initial parametric models, and as hexagons and 
crosses for the final non-parametric density distributions af- 
ter around 50 iterations. Clearly the algorithm evolves these 
models to the vicinity of the parametric sequence. Whether 
these non-parametric models are acceptable cannot be de- 
cided only on the basis of the model RMS on the sky, how- 
ever. For in a non-parametric model, substantial grid cell 
to grid cell noise can be introduced in order to improve the 
match to the data, which beyond a certain point is clearly 
unphysical. Therefore some measure of smoothness must be 
introduced in judging a model's validity. 

We measure the smoothness S of some model M as the 
sum of the absolute differences in logarithmic density, be- 
tween M and a smoothed version of M. S will be small for 
smooth M and large for noisy M. In determining S we sum 
only over grid cells with \z\ < 750 pc, to avoid contribu- 
tions from fluctuations in regions of the density grid where 
the density is very small. We smooth a model as follows, 
working with logarithmic model densities: (i) We resample 
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Figure 4. RMS SB residuals, xSBi in magnitudes, for parametric 
models on the degenerate sequence, plotted as a function of in- 
plane axial ratio rj (equivalent to varying ro.s). Diamonds indicate 
models on the sequence, stars models on the extensions of the 
sequence. 



each z-plane of the model o n a cyli ndrical grid of 30 lin- 
early spaced points in r = ^ ^ip- out to r = 7kpc, and 
60 points in azimuth 0. (ii) We smooth the model over 5 
points, first in the ^-coordinate, then in r, and finally in z, 
using second orde r polynomials (Savitzky-Golay filters, see 
Press et al. ( |l99^ )). In this way azimuthal gradients in the 
central parts of the model are not smoothed away, (iii) We 
re-interpolate to the original cartesian grid, (iv) In models 
with spiral arms, this procedure must be modified because 
the spiral arms imply density changes on small scales. For ex- 
ample, on a circle at galactocentric radius 5 kpc, the distance 
between adjacent points in in our smoothing algorithm is 
~ 500 pc; therefore the smoothing length is of the same or- 
der as the spiral arm FWHM. Thus before actually smooth- 
ing the model, we subtract from the density at every grid 
point the density of the spiral arm contribution in the initial 
parametric model, rescaled in each a;i/-plane separately. The 
rescaling factor for each a;i/-plane is determined by requiring 
that in this plane the rescaled mass of the initial model is the 
same as that of the (non-parametric) model that we actu- 
ally smooth. Having subtracted the rescaled spiral model, we 
smooth the remaining luminosity distribution as in (ii) , and 
then add the subtracted spiral density back to the smoothed 
density. This spiral arm preservation procedure is restricted 
to model planes with |z| < 450 pc, because the initial spiral 
models only extend to this height. The procedure ensures 
that the spiral arm component does not contribute to the 
final difference between M and the smoothed version of M, 
i.e., to S. 

Figure |^ shows the final non-parametric models con- 
sidered in this section in a plane of model RMS xsb and 
smoothness S. The best non-parametric models have S ~ 
4000, parametric models have typically 5* ~ 3000, and mod- 
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Figure 5. Non-parametric luminosity models s3-slO obtained 
from model SB data with the algorithm described in Section U 
The parametric models sip from which the data were generated, 
and s2p on the degenerate sequence, are also shown. The mod- 
els are characterized by their smoothness S (smaller S means 
smoother), and by their rms difference XSB with respect to the 
SB data on the sky, in magnitudes. Most models have been cal- 
culated with three values of the parameter A; the last symbol 
"a","b" or "c" in the non-parametric model name (for example 
s4a) indicates A, decreasing from "a" to "c" in multiplicative steps 
of 10. Model names printed in bold denote acceptable models, i.e., 
have quality grade _F < 3 (see eq. |5|) . 



els with S > 7000 are not smooth enough to be acceptable. 
We illustrate this in Figure ^, which shows cuts through 
model s4c (which has S ~ 9400) at z = 75 pc and model 
slOb (with S ^ 5400), also at z = 75 pc. 

We can now define a criterion to decide whether a non- 
parametric model is an acceptable representation of the SB 
data. To this end we introduce the quality grade 



F : 



V 0.076™/ \SoJ 



(5) 



where 5*0 is a measure of S for the "best" non-parametric 
models we find. We use the average So = 4827 of the three 
non-parametric models s9(a), s9(b), slO(a) printed in bold 
on Fig. ^ F is smaller for models that fit the data better 
and that are smoother. The maximum value of F for an 
acceptable model is somewhat subjective; we decided for 
F < 3. This resuhs in xsb < 0.11" for a model with 5 ~ So 
and 5* < 6800 for a model with xsb ~ 0.076™ . Models which 
violate the last criterion are not smooth enough to be viable 
(see Figure ^ . 

We have used the _F-criterium to separate the final non- 
parametric models s3-sl0 in Fig. ^ into valid and non-valid 
models. Filled hexagons indicate acceptable models {F < 3), 
large "X" marginally acceptable models with 3 < F < 4, and 
small "x" unacceptable models with F > 4 (of the three 
models computed with different A, that with lowest F is 



always shown). The hexagons thus mark the width of the 
valley of acceptable models associated with the sequence. 
This width, set by the model RMS and smoothness, trans- 
lates to an uncertainty in the structural parameters of a 
non-parametric model of ~ ±0.1 kpc in the half-mass radius 
ro.5 and « ±0.05 in the bar elongation 77. These are smaller 
than the uncertainties due to the existence of the degenerate 
sequence itself. 

Having quantified the uncertainties due to the extent 
and width of the degenerate sequence, we now show that 
these uncertainties can be much reduced if addtional dis- 
tance information is used, such as is available in the ap- 
parent magn itu de dis tributions of clump giant stars. Stanek 
et al. (( 1994 ), (1997)) have observed clump giant distribu- 
tions in a number of fields towards the bulge. In Fig. ^ we 
plot such distributions for three models on the parametric 
sequence and one at the edge of the valley of acceptable 
models around it. The relevant quantity is pr^, which has 
one extra power of r over that from the volume effect, due 
to the conversion of distance to magnitudes. The predicted 
distributions are different for the models on the sequence. 
Depending on the errors in the observations, it is thus pos- 
sible to discriminate between these models through their 
clump giant distributions. The clump giant distributions of 
models s3p and sip are very similar; however, these models 
can be discriminated on the basis of their projected surface 
brightness maps, using e.g., the F-criterium. Model s3p is 
not an acceptable model for the data generated from sip. 
The experiments conducted in this section make it likely 
that using the goodness-of-fit F and the clump giant con- 
straints together breaks the degeneracy in bar/bulge models 
that exists for fixed (^bar- 



4 DEPROJECTION OF THE INNER GALAXY 

In this section we apply the algorithm to the COBE/DIRBE 
data, using the d ust-co rrected L-band map of Spergel, 
Malhotra & Blitz ( [l996| ). These authors took the 240/im 
COBE/DIRBE map as a tracer for the distribution of dust 
to correct the COBE/DIRBE near infrared (NIR) J,K,L 
and M-band data for dust absorption. They simultaneously 
fitted parametric models for the dust and stars and with 
these models computed dereddened NIR surface brightness 
maps. The K-band emission near A ~ is dominated 

by starlight and only moderately affected by dust. In the 
L-band, near A~3.5/im, emission by hot dust and interstel- 
lar gas may be slightly more important, but dust extinction 
is reduced by about a factor of two in magnitudes as com- 
pared to the K-band. Because in some inner Galaxy regions 
extinction is significant even in the K-band, we have decided 
to use the L-band data in this paper. 



After dereddening Spergel et al. (1996) found a mean 
dispersion a « 0.076™ in colour between the K- and L-band 
maps. If we could assume identical Gaussian noise in both 
maps, 0.076"" /-\/2 would be a straightforward value to use 
for the SB error in the L-band, ass- However, the dominant 
sources of noise are probably systematic errors in the dust 
correction, correlated over several pixels and between the 
NIR maps, especially near the galactic equator. In this case 
the true errors in the data would be larger. We therefore 
take a more conservative approach and use ass ~ 0.076™ . In 
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Figure 6. A comparison of cuts through two models with significantly different smoothness 5 at 2: = 75pc. Left: Model s4c (S ~ 9400) 
Right: Model slOb (with S ~ 5400). We consider model s4c not smooth enough to be acceptable. 



the non-parametric deprojections of the NIR data described 
below we have therefore tailored the smoothness penalty 
function parameters such that we get models with RMS of 
this order. 

The models that we obtain from deprojecting the 
COBE/DIRBE L-band data will then be verified a posteriori 
by comparing with the apparent magnitude di stribu ti ons fo r 
clump giant stars, measured by Stanek et al. ((1994), ( L997)) 
along certain lines-of-sight towards the Galactic bulge. 
Clump giants have nearly identical absolute magnitudes 
with a small dispersion of ~0.2-0.3'", and it is therefore pos- 
sible to derive their distance distribution (in a statitistical 
sense) from their observed magnitude distribution. Stanek 
et al. (1997) analysed colour magnitude diagrams (CMDs) 
in several OGLE fields, including Baade's window and two 
nearly symmetric fields at {l,b) ~ (— 4.9 deg, — 3.4 deg) and 
(5.5 deg, — 3.4 deg). They determined extinction-insensitive 
magnitudes Vv-i ~V — 2.6-(V — I), and plotted histograms 
of the number of stars as a function of magnitude using 
AW-/ = 0.05^-bins for the stars in the part of the CMD 
dominated by bulge red clump stars. The red clump distri- 
butions along these LOS peak at d ifferen t distances; using a 
bootstrap technique Stanek et al. (1994) determined a rela- 
tive distance modulus of 0.37 ± 0.025™ between the LOS at 
Z~5deg and Z«— 5deg, and 0.15 ±0.02™ between Baade's 
window and the field at I = —4.9 deg. These asymmetries 
provide independent evidence for a non-axisymmetric lumi- 
nosity distribution in the inner few kpc of the MW, but 
will be used here to check the three-dimensional luminosity 
distribution of our models for the L-band flux data. 

In §4.1 and §4.2 we deproject the data with and with- 
out inclusion of spiral structure in the model, using a bar 
angle </5bar ~ 20 deg in both cases. This will demonstrate that 
inclusion of spiral structure leads to a better model for the 



COBE/DIRBE L-band data an d re sults in a more elongated 
shape for the barred bulge. In 



4.3 



we constrain the accept- 
able range of bar angle from a set of models with different 
Vbar together with the clump giant data. 

In all cases, the non-parametric density estimation pro- 
cedure begins with fitting a parametric model to the data. 
This is used in the non-parametric deprojection in three 
ways; (i) As starting model of the iterations, (ii) to cor- 
rect for the limited size of the model density grid and (iii) 
in models that include spiral structure, to define the spiral 
structure penalty function term. 



4.1 Models with bar but without spiral arms 

To find a model for the COBE/DIRBE L-band data with- 
out spiral arms, we start the iterations from th e par amet- 
ric model given by Binney, Gerhard & Spergel (1997), and 



set the spiral structure penalty term in the likelihood func- 
tion to zero. Fig. ^ shows the surface brightness map of this 
non-parametric model compared with the COBE/DIRBE L- 
band data. It fits the COBE data well; the iterations were 
stopped when the RMS of the model reached 0.073™. On 
the left side of Fig. |^ we show two cuts through this model, 
in the upper panel a cut in the a;?/- (main) plane, and in the 
lower panel a cut parallel to the iz-plane, at i/~85pc. In 
the xy map we can see overdensities that point from the ob- 
server (at a;«7.5kpc and i/~2.7kpc) towards the tangential 
points of the spiral arms, at approximately I = ± 30 deg and 
/ = —50 deg (see also Drimmel & Spergel ( |2001 )). 

Can we find a model of the MW without spiral arms 
which fits the COBE/DIRBE L-band data as well as this 
model, and does not have such features? To try to eliminate 
these overdensities, we have computed models with larger 
penalty parameters for deviations from eight-fold symmetry 
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Figure 7. Predicted number of clump giant stars in three fields observed by Stanek et al. (1997), versus relative distance modulus in 
magnitudes, for parametric models sip (upper left), s2p (upper right), s3p (lower left), and the parametric model with bar elongation 0.41 
also along the degenerate sequence in Fig. H (lower right). From the differences in height and location of the peaks in these distributions, 
it is possible to discriminate between models on the sequence. However, a comparison of the two panels on the left shows that some 
models off the sequence can mimic the clump giant distributions of models on the sequence. Such models have to be discriminated by 
their goodness-of-fit for the surface brightness data. 



and/or smoothness. However, we have only found models 
which either contain similar features in the directions to the 
spiral arm tangent points and achieve a "good" model RMS 
Ri 0.07"', or models which are smooth and nearly eight- fold 
symmetric without such features, but which then fit the data 
badly (model RMS > 0.2™). Smooth models which fit the 
data well without these features do not seem to exist. This 
suggests that a spiral arm component is implied by the data. 



I.e., the luminosity in these features is real, but the shift 
towards the observer is due to the FTS effect discussed in 



§2.4. 



We mention that a sir nilar effect was observed by Bin- 
ney, Gerhard & Spergel ( 1997 ) in their Richardson-Lucy 
(RL) models. They used the same data of Spergel et al., 
and in their models found symmetric density enhancements 
at 2 — 3 kpc down the minor axis of the bar. They suggested 
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Figure 8. Surface brightness maps of the model without spiral arms (upper panel), and our reference model 20A including spiral arms 
(lower panel). Full contours show the model surface brightness, dashed contours the COBE/DIRBE L-band data. Contour levels are in 
magnitudes with some arbitrary offset, common to both panels. Both surface brightness maps are very similar and fit the COBE/DIRBE 
L-band data with very similar x^- The underlying models are non-parametric on a grid of 5x5x1.5 kpc^ and are continued by the initial 
parametric models outside of this grid for the projection onto the sky. 



that these features might be caused by spiral arms being 
symmetrized by the RL algorithm, which forces the models 
to be eight-fold symmetric. In contrast to the RL models our 
deprojection algorithm favours changes to the model density 
near the observer, and so in the model discussed here the 
spiral arm overdensities are placed near the observer. The 
projected bar elongation in our model without spiral arms 
is 0.56 {\z\ > 225 pc), comp arabl e to that in the model of 
Binney, Gerhard & Spergel (|l997|). 



4.2 Models with bar and spiral arms 

We have seen that models for the MW L-band luminosity 
density develop overdensities near the observer, towards the 
tangential directions of Galactic spiral arms, when started 
from triaxially symmetric initial distributions. Thus we now 
proceed with a non-parametric density estimation of the 
data by a model which includes spiral arms. We show that 
the inclusion of spiral structure not only improves the model 
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Figure 9. Comparison of the models with and without spiral structure: in the main plane of the MW (upper panels), and in a plane 
parallel to the major and minor axes of the bar at the first grid point j/Ri85pc (lower panels). Both models are for bar angle 20deg. 
Contours are in logarithmic density (in CLC// kpc'^), and printed in the plot. Left: The model without spiral structure, obtained from 
a (parametric) triaxially symmetric initial bar model. This model shows deviations from eight-fold symmetry in the a;j/-map. These 
overdensities approximately point from the observer (at a;Ri7.5kpc and j;fs;2.7kpc) towards the tangential points of the spiral arms. 
Right: Our reference model 20 A including spiral arms. Both the (parametric) initial model and the penalty function in this non-parametric 
density estimation contain a spiral structure term. 



2.1): r? = 0.31, C = 0.38, 



pT= 1180 CLU, bar angle <^b; 



for the COBE/DIRBE L-band data, but also results in a bet- 
ter match to the LOS distributions of clump giants towards 
certain bulge fields. We also show that the derived structure 
of the bulge/bar does not depend significantly on whether 
the assumed spiral model is two or four-armed. 

First we fit a parametric model to the COBE/DIRBE 
L-band data, continuing to assume a bar angle <^bar = 
20 de g. This model has the following bulge parameters (see 

= 2.8 kpc, ao — 0.1 kpc, 
- 20 deg; disk parameters: 
Rd = 2.2 kpc, a = 0.65, zq = 0.19 kpc, zi = 0.042 kpc, 
Pd = 0.54 CLU/ kpc"^; and spiral arm amplitude: ds = 0.90, 
for the four-armed logar ithmi c spiral arm model similar to 
that of Ortiz & Lepine ( |l993| ). Here CLU are COBE lumi- 
nosity units as in Binney et al. (1997). 

Starting the algorithm from this configuration and in- 
cluding the spiral structure penalty term, we find a best non- 
parametric density model. This model was selected from a 
number of calculations run to fine-tune the penalty func- 
tion parameters, optimizing both the RMS and the model 
smoothness. This model, stopped at RMS ~ 0.079™, is one 
of our best if not the best model, and will be used as refer- 
ence model "20A" in what follows, deferrin g th e discussion 
of the acceptable range of <pbar to Section 4.3 below. The 



RMS for this model is similar to that for the model without 
spiral arms; the difference is not significant. Correspond- 



ingly, both models look very similar on the sky and match 
the data well; see Figure pi The main difference is in the 
three-dimensional structure: Figure ^ shows cuts through 
both models. In the model with spiral arms, these arms pro- 
vide most of the non-axisymmetric density. Some residual 
luminosity is still required towards some of the arm tangent 
points, but deviations from point-symmetry in the Galactic 
plane near the Sun {x > 3 kpc, \y\ < 5 kpc) are reduced by 
a factor of ~ 4.5 (NB: the spiral arms remain nearly point- 
symmetric during the iterations) . 

The quality of the model fit to the COBE/DIRBE L- 
band data, especially for the non-axisymmetric bar/bulge, 
is visualized by the asymmetry maps shown in Fig- 
ure |l^ These maps show the difference between the 
logarithmic fiuxes at positive and negative longitudes, 
SB(|/j, b) — SB(— b) for both our reference model 20A and 
the COBE/DIRBE L-band data. Generally, the model is 
smoother than the data, but it reproduces the main bar- 
related features of the observed map well. Note the good 
recovery of the regions with clear bar signature around 
{I, b) « (8 deg, ±5 deg), and the change of sign of the asym- 
metry near the galactic centre. Here the far side of the bar 
appears brighter, a signature of a ba r with its near end at 
positive longitudes (Blitz & Spergel (1991)). The most ob- 
vious differences between both maps are in a strong feature 
at (/,6) ~ (14 deg, deg) in the observed map, which may 
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Figure 10. Asymmetry maps for the COBE/DIRBE L-band 
data and for the reference spiral model. The COBE data were 
smoothed with a Savitzky-Golay filter (Press et al. 1994, setting 
their M = 5) to reduce noise. Contours show the asymmetry 
in magnitudes between positive and negative longitudes. Dashed 
contours indicate negative values. Positive values indicate that 
the MW is brighter at positive longitudes. Contour spacing is 
O.l'", and the bold contour is at 0"* (no left-right asymmetry). 
In the plot for the model a dotted contour is drawn additionally 
at -0.05™. 



be local, and in the contours near zero asymmetry, which 
are most affected by noise. 

Figure ^ shows the density of model 20 A projected 
along the z-axis for \z\ > 225 pc. The density near the main 
plane of the MW is excluded to avoid modifying the bar con- 
tours by the strong, nearly axially symmetric inner disk com- 
ponent. In model 20A the bar is more elongated than in the 
model without spiral arms. This is because for the relative 
geometry of the arms, the bar, and the position of the Sun, 
the spiral arms make the model appear broader in longitude 
on the sky, and for fixed observed asymmetry this allows the 
bar to be more elongated in the plane. The projection also 
stresses the true extent of the bar, which is ~ 3.5 kpc. The 
measured projected bar elongation in the xy-plane is ~10:3- 
4. The contrast in the total face-on surface density between 
{x,y) — (2.5 kpc, 0) and {x,y) = (0,2.5 kpc) is a factor of 
~ 1.6. A fit to the disk profile in the radial range between 
3.5 kpc and 5.5 kpc gives an exponential radial scale-length 
of 2.1 kpc. 

We now compare the three-dimensional struct ure o f 
the model to the observations of Stanek et al. ((1994), 



(1997)). These authors determined the line-of-sight distri- 



butions of clump giant stars for a number of fields towards 
the bulge/bar. Because these stars are of nearly identical ab- 
solute magnitude within a small dispersion, measuring their 
brightness distribution at a certain position on the sky pro- 




X [kpc] 



Figure 11. Projection of our model 20A onto the xj/-plane. To 
avoid modifying the bar contours by the strong, nearly axially 
symmetric disk component, only the density at \z\ > 225 pc was 
integrated. The length of the bar is ~ 3.5 kpc and its elongation is 
~10:3-4. Contours are in logarithmic surface density, with relative 
contour values indicated on the plot. 
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Figure 12. Line-of-sight distributions of clump giants in the di- 
rections of three fields observed by Stanek et al. (1994, 1997), 
with symbols as given in the figure legend. The abscissa is their 
Vv—i- For field M7 the observed counts were averaged over two 
CCD frames of equal angular size; for field M5 one CCD frame 
and for Baade's window six frames were used. The curves show 
our best-fit model including spiral arms. For the model, two con- 
stants were adjusted by eye as follows: (i) The model distribu- 
tions were shifted along the abscissa such that the I = 5.5 deg 
and / = —4.9 deg peaks match best the locations of the observed 
peaks, (ii) The normalisation of the model curves was determined 
such as to approximately match the normalisation of the observed 
distributions in the fields M5 (l ^ -4.9 deg) and M7 (I 5.5 deg). 
The resulting shift and normalisation are then applied to all 
model distributions simultaneously. All model distributions are 
convolved with an assumed width of the clump giant intrinsic 
luminosity distribution of 0.3™. 
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vides a profile of their density along the respective LOS. 
The apparent magnitude of the peak of the distribution lo- 
cates the highest-density point along the LOS. The differ- 
ence in the apparent magnitude of the peak between various 
bulge fields, especially between the fields at / « 5.5 deg and 
I ~ —4.9 deg, reflects the shape of the non-axisymmetric 
bar/bulge density distribution. These data therefore pro- 
vide an independent and strong test for the COBE/DIRBE 
density models. Note that the clump giant density maxima 
along the LOS are insensitive to a possible radial gradient 
in the ratio of clump giant stars to L-band luminosity. 

For the comparison we fold the model LOS density dis- 
tributions with a gaussian exp — (Amag^/2(T^), to simulate 
the intrinsic dispersion of clump giant absolute magnitudes. 
In the literature value s 0.1' " ^ cr < 0.3™ have bee n pro - 
posed (Stanek et al. ([i997|) , Stanek & Garnavich ( [l998| ) , 
Paczy nski & Stanek (1998)). See Fig. 3 of Ferryman et al. 
(1997) for an i mpress ion of the sharpness of the clump in 
V and Udalski (2000) for an analysis of the metallicity de- 
pendence on the mean I-band brightness. We have explored 
a number of different values for a between 0.2™ and 0.4™, 
and finally decided for a = 0.3™, because with this value 
our models reproduce best the observations. For each model 
we need to select two additional parameters, the normalisa- 
tion of the model density and a shift in magnitudes. These 
represent the (unknown) conversion factor between model 
density units and the number density of clump giant stars, 
and the absolute magnitudes of clump giants. For model 20A 
we determine these two parameters such that they fit best 
the observations at I « 5.5 deg (field M7) and I « —4.9 deg 
(field M5). 

Figure |l^ shows that the LOS distributions of model 
20A compare well with the clump giant observations of 
Stanek et al. Fitting Gaussians to the upper parts of the 
model curves yields an asymmetry of 0.44™ between the 
LOS at I = —4.9 deg and I = 5.5 deg, even somewhat larger 
than observed. Also the relative peak heights and approxi- 
mate widths of the model distributions agree with the data 
within ~ 10%. These are less of a constraint on the bulge 
shape, however, because they are influenced by other pa- 
rameters like the density concentration of the bulge and the 
clump width a. We remark that the choice of model normal- 
isation factor such that the main difference is in the peak 
height of Baade's window distribution is arbitrary; we could 
also have decided to make a near-perfect flt to Baade's win- 
dow distribution and a 10% error in the peak heights of 
the other two distributions. I.e., the model is slightly more 
centrally concentrated than the clump giant distribution. 

The measured asymmetry in the new model is signifi- 
cantly larger than in the eightf old-sy mmetric Richardson- 
Lucy models of Binney et al. (1997) and Bissantz et al. 
(1997). These models have a maximal asymmetry ~ 0.27™ , 
and generally < 0.2™, compared to the Stanek et al. (199'^) 
resuh of 0.37™ ± 0.025™. As Figure |l| shows, it is also sig- 
nificantly larger than the asy mmetry in the model without 
spiral arms from Section |4.l|, which is not a good fit to the 
clump giant data. In Fig. \13\ the magnitude scale for the 
different models has been chosen so that they all match the 
observed distribution in field M7. The smaller asymmetry 
of the model without spiral arms thus becomes apparent as 
deviations in the peak positions in BW and, in particular, 
field M5. The spiral model 20 A has a greater asymmetry in 



the peak positions for these fields because the elongation of 
its bar is larger, as discussed above. 

Fig. ^ also shows a model "20B" , which was obtained 
by deprojecting the COBE data with a modifled broad spi- 
ral arm model of FWHM 500 pc, and a model "20S", which 
is a smoothed version of the standard model with bar an- 
gle 20 deg. For the smoothing we have used the algorithm 
described in Section ^ in the form that preserves the spi- 
ral arms. Model 20S shows that small-scale structure in the 
luminosity model does not influence the model clump giant 
distributions in these bulge flelds signiflcantly. Model 20B is 
actually a better flt to the amplitudes of the observed dis- 
tributions than our reference model 20A. However, it does 



not fit the L-band data as well (see Section i.c below). 

So far we have considered luminosity models in which 
the spiral arm component, if present, has a four-armed struc- 
ture. T his is based mainly on observations of gas tracers (see 
Vallee ( |l995[ ) and Englmaier & Gerhard ( |l999| )). However, 
it is unclear whether the MW has two or four stellar spiral 
arms. In the L-band data the tangent point at I ~ 50 deg is 
not visible. This may point to a two-armed structure; how- 
ever, this tangent point is also weak in CO, possibly due 
to the geometry of the LOS through this arm (Dame, pri- 
vate communication). Drimmel & Spergel (2001) argue that 
the Sagittarius-Carina arm is - at least - weaker than the 
other arms. Therefore we now ask whether our results on 
the structure of the bar/bulge depend on the assumption 
of a four-armed spiral model. We have generated two non- 
parametric models of the COBE/DIRBE L-band data in 
which a two-armed parametric model was used both for the 
starting model and the spiral arm penalty term. The bar an- 
gle is still assumed to be 20 deg. In the first model, the arms 
start near the major axis of the bar, and the pitch angle is 
half that used in the four-armed model above. In the second, 
we omit the Sagittarius-Carina arm and its counter-arm, the 
arms start near the minor axis of the bar, and the pitch an- 
gle is the same as in the four-armed model. Compared with 
the four-armed model 20A, both two-armed models found 
by the algorithm show only minor differences. They fit the 
COBE/DIRBE L-band data equally well as model 20A; the 
asymmetry in the clump giant LOS distribution peak posi- 
tions differs by < 0.03™, the peak heights differ by < 12%, 
and the elongation of the bar/bulge differs by < 4%. It ap- 
pears therefore that the Scutum-Crux arm is most impor- 
tant for the deprojection of the bar. Thus the assumption 
of a four-armed spiral model does not significantly bias the 
results obtained for the structure of the bar/bulge. 

We end this section by a short summary of its main re- 
sults. The first is that inclusion of spiral structure improves 
the model of the COBE/DIRBE L-band data in the sense 
of removing unphysical structures in the disk plane. Sec- 
ond, in models including spiral structure the bar is more 
elongated as compared to triaxially symmetric models, and 
third, this more elongated bar gives a better representation 
of the observed apparent magnitude distributions of clump 
giant stars in several bulge fields. 



4.3 Constraining the bar angle 

In the previous section we have described model 20A for 
bar angle iphm = 20 deg as our reference model. We will now 
construct similar models for a variety of bar angles, and 
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Figure 13. Cn ni pa ri son of observed clump giant line-of-sight distributions with several models, all for i^bar = 20deg: the model without 
spiral arms of § [4.]| (full curve), our reference model 20A with spiral arms (dot-dashed), a similar model with broad spiral arms (20B, 
dotted), and a smoothed version of the reference model (20S, thick dot-dashed). 
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Figure 14. Smoothness parameter S and model RMS XSB for 
non-parametric models with spiral arms of the COBE/DIRBE L- 
band data, for bar angles 10 deg < i/3bar ^ 44 deg. Models in the 
lower left of the diagram both provide a good fit to the surface 
brightness and are the smoothest. Those printed in the diagram in 
bold face are acceptable models in the sense of F < 3; cf. Section 
a. Model 20B for bar angle 20 deg and with broad spiral arms is 
the only "marginally acceptable" model (3 < -F < 4). Thus bar 
angles 20 deg < </Jbar 1^ 25 deg are preferred. 



use them to constrain the possible range of ly^bar- This will 
also make clear why we selected model 20 A. We will com- 
pare these models in three different ways. In the first (and 
weakest) test we use the quality of fit of the COBE data 
for the best-fit parametric models used as starting models 
in the non-parametric deprojections. In the second test we 
compare the non-parametric models themselves, using the 
F-criterium of Section H which measures a combination of 



quality of fit to the data and model smoothness. Finally, the 
third test uses the distribution of clump giant apparen t mag - 
r iitude s along the LOS measured by Stanek et al. ((1994), 
( [l997| )). 

We have non-parametrically estimated the COBE data 
for bar angles 

<y3bar = 10 deg, 15 deg, 20 deg, 25 deg, 30 deg, 35 deg, 44 deg, 
using the standard four-armed spiral model, and addition- 
ally for iphai ~ 20 deg using a model with broad spiral arms 
of FWHM 500 pc. In each case we started the iterations from 
a corresponding parametric best-fit model. 

First we analyse these parametric initial models, and 
find that they fit best the part of the sky dominated by the 
central bulge/bar, around |Z| «8deg, fe«±5deg, when the 
bar angle is 20 deg < iphav < 30 deg. For other bar angles the 
models show systematic deviations from the COBE/DIRBE 
L-band data in these regions in {l,b), symmetric to the galac- 
tic equator. 

Second we compare the non-parametric models using 
the F criterion (goodness of fit combined with smoothness) , 
as introduced in Section Their smoothness parameters S 
and model RMS are shown in Figure ^ Models in the lower 
left corner give the best fit to the surface brightness and 
have the highest degree of smoothness. Acceptable models 
{F < 3) are the standard model 20A (v^bar = 20 deg), and 
the (^bar = 25 deg model. The i^bar = 20 deg model with 
broad spiral arms is marginally acceptable (3 < F < 4). 
The other models are not satisfactory: they are either too 
unsmooth or do not fit the SB data well. We illustrate the 
trade-off between goodness of fit and smoothness in these 
cases with two models for (/Sbar = 35 deg and tphar = 44 deg, 
obtained with different smoothness penalty parameters A. 
One of these is clearly not smooth, and the other is smooth 
but fits the COBE data poorly. We conclude that bar angles 
20 deg < V^bar 25 deg are preferred. 

Finally, we compare the predicted clump giant line- 
of-sight distrib ution s of the se models with observations by 
Stanek et al. (( [l994| ), ( |l997| )) in Figure |l|. The intrinsic dis- 
persion of clump giant absolute magnitudes is again set to 
a — 0.3™ (see Section 4.2). The remaining free parameters 



of the models, that is the normalization and the magnitude 



© 0000 RAS, MNRAS 000, 000-000 



16 N. Bissantz and O. Gerhard 



Baade's window 




Bar angle 10" 


Bar angle 25^ 


Bar angle 1 5" 


Bar angle 30^ 


Bar angle 20" ';20A) 


Bar an^gle 35" 




Bar angle 44" 










v^''-' t;>^>-c — ' 












//V / ^ 




-^.r ■■if / 








> / 




J* / 






























\ \ 




\ ' 




\ 




\ 



' 'm5 


1 ■ 1 


BoronQlslQ" 


Bar angle 2"^" 


Bar angle 1 5^ 


Bar angle ^O'^ 


■ Bar angle 20" (20A) 


Bar angle ^S"" 
Bar angle 44^^ 


^ '~~ ^ 






















/ / y /i/ 


\ \ \ \ X v 


- / / /// 










\ x. 




\ v 

\ 

\ 

\ 

\ 






TPS 




Bor angle 


10° 


Bar angle 25° 


Bar angle 


15° 


_ _^ Bar angle ^0° 


Bar angle 


20° (20A) 


*^ angle 35° 




~ -^Bar angle 44° 










J 
























J /' 






.''/ 






/ / 






N ■ . 5 






/ 


■■ * ■''^'^^ 


X X. ■■. 


// 




V \ 


// 




" - - ^ ^ >^ 


/ 




~ ^ \ 


/ 
















° ^ 














/. , // . 












// 












/ • 







Bar angle 25"^ 

Bar angle ^O'' 

Bar angle 35^ 

Bar angle 44"^ 




Figure 15. Clump giant distributions for models with different bar angles. For each model, the normalization and shift in the magnitude 
scale axe determined such that the model fits best the data in field M7. The best fit is found for bar angles V'bar 15 deg —30 deg. 
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shift, are fixed by optimizing the model fit to the observa- 
tions in field M7. These parameters are then identical for all 
fields. 

Fig. shows that several models fit the observed 
clump giant line-of-sight distributions nearly equally well. 
None of the models fits the observations in field Glf (at 
{l,b) ~ (8.2 deg, — 4.4 deg)), probably since these data are 
strongly influenced by the underlying broad population of 
stars (the power-law part in the luminosity function fit- 
ted by Stanek et al. ). The models with ifhar ~ fOdeg 
and (^bar = 44 deg provide inferior fits to the data. The 
Vbar = 10 deg model has wrong peak positions, heights, and 
widths for fields TPS (at {l,b) « (-0.1 deg, -8.0 deg)) and 
M5, and in Baade's window the peak width is too large. The 
<(5bar = 44 deg model shows wrong peak positions in Baade's 
window and field M5, and a deficit in asymmetry between 
Baade's window/field M5 and field M7 (for this model the fit 
can be improved slightly by using a very high intrisic disper- 
sion a > 0.4™. However, this is far above published values, 
see §4.2). The other models with 15 deg < ^bar ^ 30 deg 



cannot be distinguished on the basis of the present data. 

In summary, a bar angle 20 deg < VPbar < 25 deg is 
consistent with all three tests. The clump giant data are 
consistent with a wider range, 15 deg < ifihm < 30 deg; how- 
ever, for ((3bar ~ 15 deg and (pbar = 30 deg we have not been 
able to find models passing the F-criterion (goodness of fit 
combined with smoothness). The 20 deg model 20 A stands 
out by its smoothness. 



5 MICROLENSING 

We will now provide predictions for the microlensing optical 
depth for our NIR models. The required conversion factor 
from luminosity density to mass density can be found from 
fitting the observed terminal velocity curve with a model 
for the gas flow in the gravitational potential of the bar 
and disk, assuming spatially c onsta nt L-band mass-to-light 
r atio M /Ll (Bissantz et al. (1997), Englmaier & Gerhard 
(1999)). In a forthcoming paper (Bissantz, Englmaier & Ger- 
hard, in preparation) we will describe SPH simulations of 
the gasdynamics in the potentials of the new luminosity 
models presented in the present work. The value of M/ Ll 
derived from the terminal curve depends somewhat on the 
precise model parameters, for example, the pattern speed. 
For the best SPH model it is M/Ll = 3.9 x 10** Mq/ CLU, 
and between the various models it varies in the range 3.7- 
4.1 X IO^Mq/CLU, i.e., by ±5%. In the following we wiU use 
M/Ll = 3.9 X IQ^Mq/ CLU. 

With this value, the optical depth towards Baade's win- 
dow (BW) (/ = 1 deg, fe = — 3.9 deg) for our reference model 
20A is r_6 = 0.95 for the full sample of source stars, us- 
ing /3 = — 1 in the parametrization of Kiraga & Paczynski 



(1994) in accounting for a magnitude cut-off. For clump gi- 
ant sources only it is t-q = 1.39 in BW (/3 = 0). In Figure |l^ 
we present optical depth maps for both cases, predicted from 
model 20 A, over the entire inner Galaxy region. At constant 
mass normalisation, the range in luminosity density through 
BW predicted by models 15, 20B, 30 corresponds to an un- 
certainty in r of about 10 percent. 

The MACHO group has published revised values for 
the optical depth near Baade's window based on their new 



bulge microlensing data. They give an optical depth for all 
sources, based on 99 events from 3 yr of data, of r_6 = 
2.43^0 3g averaged over 8 fields ce ntred at I = 2.68 deg 
and h = -3.35 deg (Alcock et al. ( |2000a| )). From 52 mi- 
crolensing event s wit h clump giant sources in 5 yr of data, 
Popowski et al. (2000) give r_6 = 2.0±0.4 at a mean position 
I — 3.9 deg and b — —3.8 deg. The corresponding values pre- 
dicted from model 20A are r_6 = 1.10 at I = 2.68 deg and 
b — —3.35 deg (/3 = —1), and r_6 = 1-27 for clump giant 
sources at I = 3.9 deg and b — —3.8 deg (/3 = 0). Figure |l7| 
shows profiles of optical depth along galactic latitude at the 
mean longitudes of the MACHO observations. The curve for 
Z = 2.68 deg shows the optical depth for all sources, and that 
for 1 = 3.9 deg the optical depth for clump giant sources only, 
for comparison with the observational results. Both curves 
illustrate the steep dependence of t with Galactic latitude. 

The optical depth in the new NIR models is about 10% 
higher than f or the eight-fold symmetric RL models of Bis- 
santz et al. (1997). This near agreement between two in- 
dependent non-parametric models for the COBE/DIRBE 
data is gratifying. What difference there is mostly comes 
from the 10% higher luminosity to mass conversion (M/Ll) 
for the new model. The effects of the structural differences 
in the new model appear to compensate. On the one hand, 
there are more lens stars in the new model where the line-of- 
sight to Baade's window crosses the spiral arms, increasing 
r. On the other hand, because the total surface density along 
this LOS is approximately constant (since specified by the 
DIRBE SB), the density in the inner bulge is lower than 
in the models of Bissantz et al. (1997). This decreases the 
average distance to the sources, and hence r. 

Compared to the observed optical depths, the predic- 
tions of the new model are still low. For clump giant sources 
only, the model is cons istent with the preliminary new value 
(Popowski et al. (^000|)) r_6 = 2.0±0.4 to within l.Scr. One 



assumption we have made is that the microlensed source 
stars are distributed similar to the luminous matter in the 
Galaxy. This is confirmed by the good agreement of the 
clump giant distributions predicted fr om t he NIR model 
with those measured by Stanek et al. ((h994|), (|l997l)). Fur- 



ther evidence that the lensed stars do not contain a sig- 
nificant component far behind the Galactic center (e.g., 
in the Sagittarius dwarf) comes from the CMD in Fig. 2 
of Popowski et al. . With the distribution of source stars 
known, the predicted optical depth can only be modified 
significantly if the distribution of lenses is substantially dif- 
ferent from that of the sources, i.e., if mass does not follow 
NIR light. 

Associated uncertainties in the NIR model prediction 
were discussed by Bissantz et al. (1997). There appears 
to be two main causes for concern; (i) The corrections by 



Spergel et al. (1996) for dust absorption might conceivably 
have caused us to overestimate the luminosity in the Galac- 
tic plane. In this case, there could be room for some lensing 
dark mass in front of the bulge fields, (ii) Independently, the 
L-band mass-to-light ratio might vary with position in the 
inner Galaxy. Both would have the effect of modifying the 
mass distribution of the inner Galaxy but, as discussed by 
Bissantz et al., the effect of this on the optical depth is lim- 
ited to ~ 20% because of the constraints from the terminal 
velocity curve. Together with the 10% spread in the model 
optical depth discussed above, this implies an uncertainty 
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Figure 16. Microlensing optical depth map of our reference model including spiral arms. The model is for bar angle 20deg. The upper 
map shows the optical depth for all sources, the lower map for clump giant sources only. The mean positions of the newly published 
MACHO results are indicated in each map. 



in the predicted clump giant value of order 0.3, i.e., there is 
no strong discrepancy with the clump giant measurement of 
Popowski et al. . 

However, the high optical depth of r_6 = 2.43lQ'3g ^'-'^ 
all sources me asured from difference imaging analysis (DIA, 
Alcock et al. (2000c)) is 3.5cr away from the predicted value 
of model 20A (r_6 = 1.10), and even after allowing for a 30% 
uncertainty in the predicted optical depth is still more than 
2.5(T discrepant. From the measured optical depth, Alcock 
et al. deduced Z.2itofo for bulge sources only, assuming 
a 25% co ntribu tion from disk sources. Binney, Bissantz, & 
Gerhard (2000) have shown such high optical depths can- 



not plausibly be reconciled with the Galactic rotation curve 
and the mass density near the Sun. To underline their ar- 
gument, to increase the optical depth from r-e = 1.10 to 
r_6 = 2.43 for the same distribution of sources and (5 = —1, 
would require an additional surface mass density even at 
near-optimal distance, ~ 4kpc, of some 1540 M0/ pc^, com- 



parable to the luminous mass density already present in the 
NIR mass model (3636 M0/ pc^). This may suggest a prob- 
lem in the interpretation of the DIA measurement, for ex- 
ample, in the correction for amplification bias. 

We end this section by commenting on the microlensing 
contribution of the MW's dark halo. The NIR models with 
the quoted M/ Ll reproduce the Galactic terminal velocity 
curve out to > 5 kpc without inclusion of a dark halo (Bis- 
santz, Englmaier & Gerhard, in preparation). If the LSR 
circular spee d is V c = 220km/s (consiste nt wit h Ro = 8 kpc, 
Reid et al. ( |l99g| ). Backer & Sramek ( |l999| )), some dark 
matter is required between 5 kpc and the solar radius, but 
most of this will be at high latitudes, while the line-of-sight 
to Baade's window, for example, is within one disk scale- 
height zo in this range of Galactocentric radii. Decreasing 
the amount of luminous mass in the inner Galaxy in favour 
of dark matter also does not help even if the dark matter 
microlenses; this case is included in the ~ 20% uncertainty 
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Figure 17. Microlensing optical depth of our reference model at 
the longitudes of the newly published MACHO results, plotted as 
function of galactic latitude. The observations are indicated in the 
figure. The upper curve shows the optical depth for clump giant 
sources, the lower curve for all sources. Both curves are for the 
galactic longitude of the published observations for the respective 
group of sources. 



discussed above. Mo reover, from the LMC microlensing re- 
sults (Alcock et al. (2000b)) we know that at most a small 
fraction of this dark matter would actually microlense, so 
that this would likely decrease the predicted optical depth 
towards the bulge. 



6 SUMMARY AND CONCLUSIONS 

We have developed a penalized maximum likelihood algo- 
rithm that enables us to non-parametrically estimate lumi- 
nosity densities from surface brightness data. In testing this 
algorithm with artificial data we found a degeneracy in the 
bar shape for fixed bar angle, which essentially comes from 
noise in the data. This implies the existence of a sequence of 
parametric and non-parametric models that within a given 
noise level in the data can not be distinguished. For the noise 
level typical of the COBE/DIRBE data, this sequence cor- 
responds to an uncertainty of the in-plane bar elongation r; 
of ±0.1 and a corresponding variation in the half mass ra- 
dius ro.5 of the bar/bulge of ±20%. However, we show that 
the degeneracy between models on this sequence can be bro- 
ken by comparing them with the line-of-sight distributions 
of clump giant stars. 

We have non-parametrically estimated luminosity dis- 
tributions for the COBE data, including a model for spiral 
structure in the disk. This is done in two steps. First we 
fit a parametric model to the data, then we improve this 
with the non-parametric algorithm. The initial mod el con - 
tains a spiral arm term proposed by Ortiz & Lepine (1993), 



which is also used as a prior in a penalty term that is added 
to the likelihood function in subsequent iterations. Models 
with spiral arms do not have the unrealistic finger-to-Sun 
features that are commonly seen in models with an axisym- 
metric disk, and at the same time fit the surface brightness 
maps equally well. 

We have considered a sequence of models with varying 
bar angles lOdeg < ipha.r < 44deg. We evaluate these using 



both a criterion measuring a combination of the goodness- 
of-fit to the COBE data and the intrinsic smoothness of 
the luminosity distribution, and the degree to which they 
account for the asymmetry in th e clu m p gian t line-of-sight 
distributions from Stanek et al. ((1994), (1997)). In this way 
we find a preferred range 15deg < (^bar < 30 deg, with the 
best models found for 20 deg < vsbar ^ 25 deg. In our refer- 
ence (pha.1 ~ 20 deg model, the length of the bar is ~ 3.5 kpc, 
and its axis ratios are 10 : 3 — 4 : 3. The in-plane elongation 
is larger than in previous eightfold symmetric luminosity 
distributions, because spiral arms make the model appear 
broader on the sky, thereby requiring a more elongated bar 
for fixed surface brightness data. The more elongated bar in 
turn increases the asymmetry in the peak distances of the 
model's clump giant line-o f-sigh t distrib utions in the fields 
observed by Stanek et al. ((1994), (1997)), enabling the new 
model to reproduce these observations well. 

Analysing a model with two spiral ar ms ins tead of the 
four-armed structure of Ortiz & Lepine (1995), we have 
concluded that our results regarding the structure of the 
bar/bulge structure and the fit to the clump giant line-of- 
sight observations do not depend significantly on the as- 
sumed spiral arm model, as long as the spiral arm tangent 
points as seen from the Sun are similar. 

The microlensing optical depth in Baade's window for 
our reference model is r_6 ~ 0.95 for all sources and 
r_fi ~ 1.39 for clump giant sources only, when the NIR mass- 
to- light ratio is assumed to be constant and is determined by 
fitting to the Galactic terminal velocity curve (maximal disk 
model, Bissantz, Englmaier & Gerhard, in preparation). For 
clump giant sources at {l,b) = (3.9 deg, —3.8 deg) we find 
r_6 = r/10"® = 1.27, within l.Scr of the new MACH O mea - 
surement r_6 = 2.0 ± 0.4 given by Popowski et al. (2000). 
The value for all sources at {I, b) = (2.68 deg, —3.35 deg) is 
r_6 — 1.1, still > 3(7 away from the published MACHO 
DIA value r_6 = 2.43to.38- The dispersion of these r_6 val- 
ues within our models is ~ 10%. Because the NIR model 
is a good representation for the distribution of microlensing 
sources, the predicted values can only be modified signifi- 
cantly if the distribution of lenses is different from that of the 
sources. This, however, is constrained because of the good fit 
of the predicted model terminal curve to the Galactic termi- 
nal cu rve. As we have previously estimated (Bissantz et al. 
(1997)), this makes it difficult to increase the predicted op- 
tical depths by > 20%. Thus the MW disk and bulge must 
have near-maximal mass-to-light ratio to explain even the 
clump giant value for the optical depth. As Binney et al. 
( ^000| ) have argued, optical depths as high as the DIA value 
are difficult to obtain by any model that is constrained by 
the Galactic rotation curve and local disk density. 
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